Skip to main content

All Questions

Tagged with
3votes
1answer
34views

Name of algorithm that maps a string column to a float column, based on an aggregation with another float column , similar to TF-IDF

The Question I'm not super familiar with the name's of common algorithms in Data Science, and I feel like this would be something that is commonly used, and so should have a name - want to refer to ...
Mike Kennard's user avatar
1vote
4answers
75views

Deep Learning model for classifying skin diseases

I have planned to create a deep learning model that classifies skin diseases(around 5 to 7 diseases). Please suggest me a good deep learning model to go with. I am planning to integrate this model as ...
Secret's user avatar
2votes
3answers
2kviews

Which algorithm to use for transactional data

I'm given a Dataset of transactions and asked to find insights for businesses. I'm extremely new to ML / Data science and have only been experiencing with KMeans. The dataset has the following ...
Liam Louw's user avatar
1vote
0answers
505views

Pattern recognition in 2D dataset [closed]

I'm not familiar with the ways and tools of data scientists but I have a background in software engineering and mathematics. I'd need to recognize patterns in a two-dimensional data set. I have a lot ...
bencemeszaros's user avatar
3votes
1answer
4kviews

Reinforcement Learning on data only (NO emulators)

My team and I started digging into RL for the purpose of a specific application. We have plenty of data of an agent carrying out suboptimal policies (states and rewards...). It is too costly for us ...
user26616's user avatar
2votes
1answer
72views

Are there deduplication algorithms that do not work on a metric space?

Recently I got interested in the process of data cleansing and specifically in record linkage. Thus far I read about deterministic and probabilistic approaches to deduplicate data sets and to some ...
Imago's user avatar
0votes
2answers
370views

How synthetic sampling of data helps classifier to predict better result?

I am currently working on data imbalance using SMOTE for binary and other algorithms for the multi-class problem. I have the idea how to create the synthetic example to bring noticeable accuracy on a ...
BoyInDaBox89's user avatar
0votes
1answer
47views

Convert Lat /Lon of User input to Lat/Lon of Open Data

I have data from a public data set in gridded form 2.5 degree x 2.5 degree(lat,lon). Latitude goes from 90 N to -90 S. Longitude goes from 0 to 357.5. It is stored every 2.5 degrees and there are no ...
user avatar
1vote
1answer
385views

Sequence of numbers as single feature

Is it possible to use a sequence of numbers as one feature? For example, using libsvm data format: ...
pythonphobia's user avatar
1vote
1answer
92views

Versatile data structure for combined statistics

As a programmer, when you have a system/component implemented, you might want to allow some performance monitoring. For example to query how often a function call was used, how long it took and so on. ...
eckes's user avatar

close